Evaluating the Potential of Programmable Multiprocessor Cache Controllers

نویسندگان

  • John B. Carter
  • Mike Hibler
  • Ravindra R. Kuramkote
چکیده

The next generation of scalable parallel systems e g machines by KSR Convex and others will have shared memory supported in hardware unlike most current generation machines e g o erings by Intel nCube and Thinking Machines However current shared memory architectures are constrained by the fact that their cache controllers are hardwired and in exible which limits the range of programs that can achieve scalable performance This observation has led a number of researchers to propose building programmable multiprocessor cache controllers that can implement a variety of caching protocols support multiple communication paradigms or accept guidance from software To evaluate the potential performance bene ts of these designs we have simulated ve SPLASH benchmark programs on a virtual multiprocessor that supports ve directory based caching protocols When we compared the o line optimal performance of this design wherein each cache line was maintained using the protocol that required the least communication with the performance achieved when using a single protocol for all lines we found that use of the optimal protocol reduced consistency tra c by with a mean improvement of Cache miss rates also dropped by up to Thus the combination of programmable or tunable hardware and software able to exploit this added exibility e g via user pragmas or compiler analysis could dramatically improve the performance of future shared memory multiprocessors Evaluating the Potential of Programmable Multiprocessor Cache Controllers John B Carter Mike Hibler Ravindra R Kuramkote Department of Computer Science University of Utah

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

USC CENG Technical Report CENG - 94 - 15 THE USC MULTIPROCESSOR TESTBED PROJECT : PROJECT OVERVIEW

In multiprocessor systems, processing nodes contain a processor, some cache and a share of the system memory, and are connected through a scalable interconnect. The system memory partitions may be shared (shared-memory systems) or disjoint (messagepassing systems). Within each class of systems many architectural variations are possible. Comparisons among systems are difficult because of the lac...

متن کامل

Rpm: a Rapid Prototyping Engine for Multiprocessor Systems1

In multiprocessor systems, processing nodes contain a processor, some cache and a share of the system memory, and are connected through a scalable interconnect. The system memory partitions may be shared (shared-memory systems) or disjoint (messagepassing systems). Within each class of systems many architectural variations are possible. Fair comparisons among systems are difficult because of th...

متن کامل

The Flash Multiprocessor: Designing a Flexible and Scalable System

The choice of a communication paradigm, or protocol, is central to the design of a largescale multiprocessor system. Unlike traditional multiprocessors, the FLASH machine uses a programmable node controller, called MAGIC, to implement all protocol processing. The architecture of the MAGIC chip allows FLASH to support multiple communication paradigms — in particular, cache-coherent shared memory...

متن کامل

A rule-based evaluation of ladder logic diagram and timed petri nets for programmable logic controllers

This paper describes an evaluation through a case study by measuring a rule-based approach, which proposed for ladder logic diagrams and Petri nets. In the beginning, programmable logic controllers were widely designed by ladder logic diagrams. When complexity and functionality of manufacturing systems increases, developing their software is becoming more difficult. Thus, Petri nets as a high l...

متن کامل

Rapid Hardware Prototyping on Rpm-2: Methodology and Experience

Field-Programmable Gate Arrays is an emerging technology which promises easy hardware reconfigurability by software at low cost. Entire systems can be built in which some parts are programmable. Such systems implement various architectures. Each architecture prototype is a detailed hardware implementation of the architecture -including I/O-on which complex software systems can be ported. We hav...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1994